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Abstract 

A sorting network is a shortest path from 12 • • • ?i to n • • • 21 in the 
Cayley graph of Sn generated by nearest-neighbour swaps. We prove 
that for a uniform random sorting network, as n — > oo the space- 
time process of swaps converges to the product of semicircle law and 
Lebesgue measure. We conjecture that the trajectories of individual 
particles converge to random sine curves, while the permutation ma- 
(yQ \ trix at half-time converges to the projected surface measure of the 

■ 2-sphere. We prove that, in the limit, the trajectories are Holder-1/2 
continuous, while the support of the permutation matrix lies within 

I a certain octagon. A key tool is a connection with random Young 

■ tableaux. 

O 

■ 1 Introduction 



Let Sn be the symmetric group of all permutations a = {a{l), . . . , a{n)) on 
{1, . . . ,n}, with composition given by (crr)(z) := cr(r(z)). For 1 < s < — 1 
denote the adjacent transposition or swap at location s hj Ts := { s s + 1) = 
, (1,2,..., s+1, s, . . . , n) G Sn- Denote the identity id := (1,2,..., n) and the 

reverse permutation p := (n, . . . , 2, 1). An n-element sorting network 
is a sequence u = {si, . . . , s^) such that 
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Figure 1: Selected particle trajectories for a uniformly chosen 2000-element 
sorting network. 



Figure 2: The permutation ma- 
trix of the half-time configuration 
c"Af/2 for a uniformly chosen 2000- 
element sorting network. 




2 




1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 1 2 3 4 5 6 



Figure 3: Left: the "wiring diagram" of the 6-element sorting network oo = 
(1, 2, 1, 3, 4, 5, 2, 1, 3, 2, 1, 4, 3, 2, 1). The swap process is shown by the black 
discs. The trajectory of particle 3 is highlighted. Right: the graph (or 
permutation matrix) of the configuration a-i = (3, 4, 2, 5, 6, 1) of u at time 7. 



where 

N :-- 



2^ 

(It is easily verified that is the minimum possible length of a sequence of 
swaps whose composition is p, while p is the unique permutation for which 
this minimum length is maximized.) For 1 < A; < we refer to Sk = Sk{w) as 
the kth swap location, and we call the permutation = CTki^) '■= ^si ' ' ' 
the configuration at time k. We call cr^^{i) the location of particle i 
at time k, and we call the function k i— > cr^^(«) the trajectory of particle i. 
See Figures 1, 2 and 3 for some illustrations. 

Let Qn be the set of all n-element sorting networks, and let Pu = 
denote the uniform probability measure on Qn (assigning probability 
to each u G r2„,). We refer to a random sorting network chosen according to 
Pu as a uniform sorting network (USN). 

Our first results concern the swap locations. 

Theorem 1 (Stationarity and semicircle law). Let Un he a uniform 
n-element sorting network. 

(i) The random sequence (si,...,SAr) of swap locations is stationary; 
that is (si, . . . , sn~i) and (s2, . . . , sn) are equal in law under ¥u- 

(a) The first swap location Si satisfies the convergence in distribution 

2si{uJn)/n — 1 Z as n ^ oo 

where Z is a random variable with semicircle law; that is with proba- 
bility density function \\/\. ~ y"^ for y G (—1, 1). 
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Figure 4: A histogram of the swap process for a uniformly chosen 2000- 
element sorting network. (The height of each column represents the number 
of swaps in the corresponding space-time window.) 

In fact we can compute the exact distribution of si for each n; see Propo- 
sition 9. In addition we establish the following "law of large numbers" for 
the swap locations. For an ra-element sorting network u, define the scaled 
swap process rj = rjiuo) to be the measure 

k=l 

where y) is the point measure at (x, on R^. Figure 4 is a histogram 
of 7] for a uniform 2000-element sorting network. Denote the semicircle mea- 
sure by 5tra\{dy) := '^^Jl — dy, and Lebesgue measure on [0, 1] by 
Sjth{dx) := lxe[o,i] 

Theorem 2 (Law of large numbers). Let ujn he a uniform n- element 
sorting network. The scaled swap process rj satisfies 

rj{ujn) =^ iieb X semi as n ^ oo. 

Here =^ denotes convergence in distribution of random measures in the 
vague topology on Borel measures on M^, and the right side denotes the de- 
terministic product measure. 
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For a sorting network cu, define the scaled trajectory Tj(t) = Ti{t,uo) of 
particle i by 

Tiit) ■.= 2a;^{T)/n-l 
when tN is an integer, and by linear interpolation for other t G [0, 1]. 

Theorem 3 (Holder trajectories). Let ujn he a uniform n-element sorting 
network. 

(i) For any e > 0, the scaled trajectories satisfy 
Fl(Wz,s,t : \T,{t) - Ti{s)\ < v^|t-s|'^' + 



as n ^ oo. 



(a) Let Tj(„) be the scaled trajectory of an arbitrarily chosen particle 
i{n) = i{n,ujn) in uJn- Then the random sequence {Ti(^n)}'^=i has sub- 
sequential limits in distribution with respect to uniform convergence of 
functions, and any subsequential limit is supported on Holder(\/8, ^) 
continuous paths. 

For a uniform sorting network, the particle configuration ak at a given 
time is a random permutation. We prove the following bounds on its distri- 
bution. 

Theorem 4 (Octagon bounds). Let uJn be a uniform n-element sorting 
network. For any e > Q we have 



|o"fc(^) — "^1 < dk + en, and 
kfe(0 - in-i)\ < dN-k + en 



P'^ VA;,i ! . 1^1 as n ^ oo. 



where dk := n^^{2-^). 



Theorem 4 states that for each t, all the I's in the permutation matrix 
of the configuration a\tN\ he within a certain octagon asymptotically almost 
surely; see Figure 5. 

Results of [5] and [12] give rise to an efficient algorithm for exactly sam- 
pling a uniform sorting network (specifically, see Theorems 8 and 13 in this 
article). The resulting simulations, together with heuristic arguments, have 
led us to striking conjectures about the asymptotic behaviour of the uniform 
sorting network. 
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Figure 5: Graphs of the configurations at times 0, j^, -j^, . . . ,N for a uni- 
formly chosen 500-element sorting network. Also shown are the asymptotic 
"octagon bounds" of Theorem 4, and the conjectural asymptotic "ellipse 
bounds" implied by Conjecture 2. 



Figure 1 illustrates some trajectories for a uniform 2000-element sorting 
network. We conjecture that as n — >■ oo, all particle trajectories converge to 
sine curves of random amplitudes and phases. 

Conjecture 1 (Sine trajectories). Letujn be an n- element uniform sorting 
network and let Ti be the scaled trajectory of particle i. For each n there exist 
random variables (6")"=i such that for all e > 0, 



P?/ max max \TAt, cu„) - A" sinfvrt + 6") I > ^'l ^ 

V je[i,n] te[o,i] ' ' / 



as n ^ oo. 



Figures 2 and 5 illustrate the graphs {{i,ak{i)) ■ i G [1,?^-]} (i.e. the loca- 
tions of I's in the permutation matrix) of some configurations from uniform 
sorting networks. We conjecture that as n — >■ oo the graphs asymptotically 
concentrate in a family of ellipses, with a certain particle density in the in- 
terior of the ellipse. Define the scaled configuration Ht = Ht{^^) at time t 
by 



n ^-^ \n n ' 



=1 



We define the Archimedes measure with parameter t G (0, 1) by 



2ltcf)j((ix X dy) J sin^(7rt) + 2xycos{TTt) — — 



V dy. 
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Conjecture 2 (Archimedes configurations). LetuOn he an n- element uni- 
form sorting network. For all t G (0,1), the scaled configuration at time t 
satisfies 

fit{uJn) =^ 2trc[)i as n ^ oo. 

Here =^ denotes convergence in distribution in the vague topology for ran- 
dom Borel measures on M^. 

In the case t = 1/2, the measure 2trcf)i/2 has density 1/ (27r-\/l — — y^) 
on the circular disc x"^ + y'^ < 1. This is the unique circularly symmet- 
ric measure whose linear projections are uniform. It may be obtained by 
projecting surface area measure on the 2-sphere in onto — that this 
gives a measure with the aforementioned property follows from the observa- 
tion of Archimedes that the surface area of a sphere between two horizontal 
planes equals the corresponding area of a circumscribed vertical cylinder. 
(The claimed uniqueness follows from uniqueness of the characteristic func- 
tion, [13, Theorem 5.3]). For general t the measure Slrcf)^ is obtained from 
Qlxci)i/2 by the linear transformation {x,y) (x,xcos(7rt) +ysm{7it)), and 
is supported on the interior of an ellipse — see Figure 5. 

Conjectures 1 and 2 (and more) are implied by a very natural conjecture 
about the geometry of uniform sorting networks. The permutahedron is 
the natural embedding of the Cayley graph (iS„, {Ti)1~^) in Euclidean space 
in which we assign the permutation a G iS„ to the point 

= {a~\l),...,a~\n)) G M". 

For all a G 5„, clearly lies on the {n — 2)-sphere 

§„ := {zeR-: Er=i^^ = ^} n G M" : ELi = "("+^f"+^^ }, 

while id~^ and are antipodal points on S„. Furthermore each edge of the 
Cayley graph has Euclidean length ||o"~^ — (crrj)~^||2 = See Figure 6 for 
an illustration of the case n = 4 (where S4 is a 2-sphere) . A sorting network 
corresponds to a shortest path from id~ to in the Cayley graph. It is 
natural to guess that such a path might typically be close to a great circle 
of §„; that is, a Euclidean circle in having the same centre and radius 
as §„. We show that, if a sorting network lies close to some great circle, 
then its trajectories are approximately sine curves, its particle configurations 
approximate Archimedes measure, and its swap locations are approximately 
governed by the semicircle law. 
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Figure 6: The permutahedron for n = 4. 



Theorem 5 (Great circles). Suppose for eachn that Un is a (non-random) 
n-element sorting network, and suppose that there is a sequence of great 
circles Cn C S„ such that 

d^{uJn, Cn) = o{n) as n ^ oo, 

(with distance defined as dooi^^.c) := maxjg[i 7v] inf^gc ~ -^Hooy'- Then: 
(i) there exist a^,0^ such that the scaled trajectories satisfy 

max max Tj(t, Un) — a? sinfyrt + O"") 
ie[i,N]te[o,i] 

(a) for all t G (0, 1), the scaled configuration satisfies the vague conver- 
gence 

UtiyJn) ==> Slrcf)^ as n ^ oo; 
(Hi) the scaled swap process satisfies the vague convergence 
Tjiun) => iieb X semi as n ^ oo. 

We conjecture that, asymptotically almost surely as n — >■ cxd, the uniform 
sorting network does indeed lie close to a great circle on the permutahedron. 
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Conjecture 3 (Great circles). Let uOn he an n- element uniform sorting 
network. For each n there exists a random great circle Cn C such that 

doo{uJn, Cn) = o{n) in probability as n ^ oo. 

Simulations provide overwhelming numerical evidence in support of Con- 
jecture 3. Indeed, the evidence suggests that for the optimum great circle, 
typically doo{uJn,Cn) ~ const x where a ~ 1/2. For example, an exact 
simulation of a 10000-element uniform sorting network gave (ioo(i^ioooo; c) < 
159 for a certain great circle c. If Conjecture 3 holds then, by Theorem 5, 
Conjectures 1 and 2 follow, as well as the result in Theorem 2. The fact 
that Theorem 2 does indeed hold thus provides some further circumstantial 
evidence for Conjecture 3. It is interesting that the proofs of Theorem 2 and 
Theorem 5(iii) use entirely different methods. 

In addition to Theorem 4, we note that certain other special permutations 
may be shown to have asymptotically much lower probability than others. 
Since the number of permutations is n!, at any given time step k G [0,A'"] 
there must exist some permutation which is visited with probability at least 
l/n\ > exp[— nlogri]. However, some permutations are much less likely, as 
illustrated by the following. 

Example 6. For n even, let h = N/2 — n/4, and consider the permutation 
ip := I — 1, . . . , 1, 77,, n — 1, . . . , I + 1^ The probability that the uniform 
sorting network passes through particle configuration ip equals 

Fl{ah = V) = exp [ - + 0{n)] as n ^ oo. 

(This will be verified in Section 3.) 



Remarks 

History and connections. Sorting networks were first considered by 
Stanley [21], who proved the remarkable formula 

(2)' 

- p3n-i5n-2...(2n-3)i' ^'^^ 

Another breakthrough was achieved by Edelman and Greene [5], who ob- 
tained a bijective proof of (2). (A related approach to the enumeration of sort- 
ing networks was independently developed by Lascoux and Schiitzenberger; 
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see [17], [10, p. 94-95].) The Edelman-Greene bijection is between the set Qn 
of sorting networks and the set of all staircase-shape standard Young tableaux 
of size n. This bijection will be an important ingredient for our results; we 
describe it in Section 4. See [7, 10, 11, 18, 20] for further background. 

Sorting networks are of interest in computer science, since they can be 
interpreted as networks of comparators capable of sorting any sequence into 
descending order; see [16, Exer. 5.3.4.36-38]. There is also a connection with 
change- ringing (English- style church bell ringing); for background see [23] 
and the references therein. 

About the proofs. The proof of Theorem l(i) is very simple, and the 
proof of (ii) is straightforward given the results of [5]. Similar computations 
appear in [20]. Our proofs of Theorems 2, 3 and 4 are more involved, and 
depend on results from [19] on limiting profiles for random Young tableaux. 
A key tool is an extension of the result in [19] from square tableaux to 
staircase tableaux; see Section 5. Theorem 3 is a straightforward consequence 
of Theorem 4 together with Theorem l(i). The proof of Theorem 5 employs 
geometric arguments, and relies on the characterization of 2trc[)i/2 the 
unique measure all of whose linear projections are uniform on [—1, 1]. 

Simulations. As remarked above, simulation evidence strongly supports 
Conjecture 3. The measurement (ioo(i-^ioooo! c) < 159 was obtained by using 
an exact simulation of a 10000-element USN Wiqqqq, and calculating the max- 
imum L°° distance from the configuration o"^^ at time k to a point moving 
at constant angular speed around the great circle c that passes through a^^ 
and cr~j2- Iii contrast, applying the same procedure to the "bubble sort" 
network u = {1,2, . . . ,n, 1, 2, . . . , n — 1, . . . , 1,2, 1) gives for n = 10000 
a distance of approximately 9997. It is also easy to see that the condition 
d^ujn, c„) = o(n) does not hold for every sequence of sorting networks Un- For 
example, it does not hold for any sequence of sorting networks which pass 
through the permutation ijj in Example 6, since the configuration at time 
[A/2J ^ h cannot satisfy the condition in Theorem 5(ii). 

A particularly striking illustration of Conjectures 1-3 results from plot- 
ting the graph of the permutation then viewing the animation 
as k varies. Stationarity (Theorem l(i)) implies that at any given time the 
picture will resemble Figure 2, while at time k = N/2 the initial picture will 
have been exactly rotated by ti/2. In fact (for large n) the points appear to 
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Figure 7: The evolution of the 
permutation graph of a shding 
window, modulo uniform rota- 
tion, for a uniformly chosen 500- 
element sorting network. 



rotate all at the same constant angular speed. To further illustrate this we 
may simultaneously rotate the entire picture by the (uniformly changing) an- 
gle —irk/N, and plot the resulting paths of the moving points as k increases 
from to N/2. This is shown in Figure 7. The observation that each path 
is localized is a manifestation of Conjectures 1 and 3. 

Further works. In forthcoming articles [1, 2, 3] we study several closely 
related issues. In [3] we prove further bounds on the configurations ak in the 
USN. In [2] we study the local structure of the swap process. In [1] we study 
another natural probability measure on sorting networks, in which at every 
step, a swap location is chosen uniformly from among those locations where 
the two particles are in increasing order. It turns out that this model can be 
analyzed in detail via the theory of exclusion processes. Its behaviour is very 
different from that of the USN, but it has the property, apparently shared by 
the USN (see Conjecture 1), that asymptotically each particle initially moves 
at a well-defined randomly chosen speed, and continues on a trajectory which 
is deterministic given this initial choice. 

Stretchable sorting networks. The following is one way to generate a 
sorting network. Consider a set of n points in general position in M^, and 
label them 1, . . . ,n in order of increasing x-coordinate. Now rotate the set 
of points by an angle 6. For all but finitely many 6, listing the labels of 
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the points in order of increasing x-coordinate gives a permutation in Sn- 
And if we increase 6 continuously from to vr, these permutations yield the 
sequence of configurations for a sorting network. Not all sorting networks 
can be obtained in this way; in fact those which can are exactly those whose 
wiring diagram may be drawn in the plane so that all the trajectories are 
straight lines; such networks are called stretchable — see [11] for details. (The 
smallest non-stretchable network, unique up to symmetries, is the 5-element 
example u = (1,3,4,2,1,3,4,2,1,3).) In the proof of Theorem 5 we will 
see that the assumption of that theorem implies that the sorting network is 
approximated by a stretchable network obtained by rotating a set of points 
in which approximate the Archimedes measure 2lrcf)i/2- 

Consider an n-element USN, and choose m out of the n particles uniformly 
at random, independently of the USN. If we observe only the relative order 
of these m particles then we obtain a random m-element sorting network. If 
Conjecture 3 holds then it may be deduced that, as oo with m fixed, the 
distribution of this sorting network converges to a measure whose support 
is exactly the set of stretchable m-element networks. This follows from the 
proofs in Section 8. 

Gallery. For more simulation pictures, see the gallery 
http : //www. math. ubc . ca/~holroyd/sort. 

2 Preliminaries 

In this section we present some definitions and basic results. 

Proof of Theorem \i uj = (si, . . . , Sj^) is any sorting network then it is 
easily seen that 



is also a sorting network, and furthermore that the map uj ^ uj' is a, bijection 




from Vtn to Vtn- The result follows immediately. 



□ 



We note also that 




(3) 



and 




(4) 
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are bijections from f2„ to so the measure has the corresponding sym- 
metries. 

For a permutation a G denote the inversion number 

inv((T) = j) : l<i< i <n and a{i) > cr(j)|. 

It is straightforward to see that inv(cr) is the graph-theoretic distance from 
the identity to a in the Cayley graph of Sn generated by the swaps {ri, . . . , 
Tn-i}- Hence in any sorting network we have inv(crfc) = k for all k. 

3 Young tableaux 

Young tableaux are a central tool in our proofs; we start by introducing some 
standard notation and facts. Let G N := {1, 2, . . .}. A partition of is 
a sequence A = (Ai, A2, . . . , \k) of positive integers such that Ai > A2 > . . . > 
Afc and A^ = Aj. We denote |A| := A^. We identify each partition A with its 
associated Young diagram, which is the set {{i,j) gN^ : 1 <i < k, 1 < 
j ^ Aj}. Traditionally each element (called a cell) in the diagram is 

drawn as a square, in the coordinate system with (1, 1) at the top- left and 
(1, 2) to its right. We denote the set of partitions of A^ by Par(A^). 

Two Young diagrams will play a central role: the nxn square diagram 
{n,n, . . . ,n), which we denote by and the staircase diagram (n— 1, n — 
2, . . . , 1), which we denote by A,„. 

If A G Par(A^), let A' = (A'^^, Ag, . . . , A'^) denote the conjugate partition 
to A, where d = \i and A^ = #{1 < j < k : \j > i}- The conjugate partition 
corresponds to the Young diagram obtained by reflecting the Young diagram 
of A along the northwest-southeast diagonal. 

A Young tableau of shape A, where A G Par(A^), is an assignment of 
positive integers, called entries, to the cells of A such that every row and 
column of the diagram contain increasing sequences of numbers. A standard 
Young tableau (SYT) is a Young tableau in which the numbers assigned 
to all the cells are 1, 2, . . . , A^. See Figure 8. We denote the set of SYT of 
shape A by SYT(A), and we denote d{\) = #SYT(A) (sometimes called the 
dimension of A in representation-theoretic contexts), the number of standard 
Young tableaux of shape A. Frame, Robinson and Thrall [9], [16, Sec. 1.5.4] 
proved the following formula for d{X). 
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Figure 8: The Young diagram (4,4,1), a Young tableau and a standard 
Young tableau. 



Theorem 7 (Hook formula; Frame, Robinson and Thrall). For each 
cell G A let hij{X) := Aj — j + A^- — i + 1 be the hook number of 
in A. Then 

For two Young diagrams /x, A write A ( "/i increases to A" ) to mean 
that A can be obtained from n by the addition of one cell. The Young lattice 
is the directed graph whose vertex set is U^^o-P^^(^) whose edges are 
all the pairs (/i. A) with fi y X. Standard Young tableaux of shape A are in 
bijection with paths in the Young lattice leading from the empty diagram 
to A: to the path = Aq / Ai / Aa / ■ ■ ■ / A^v = A we attach the SYT 
which records the order in which new cells were added to the diagrams along 
the path, i.e., the unique tableau T = (ii,j)(i,j)GA such that for aA\ < k < N 
we have that 

h = {{hj) e A : tij < k}. 
We call T the recording tableau of the increasing sequence of diagrams 

(Afc)o<A:<Ar- 

As an illustration of the use of the hook formula for sorting networks, we 
verify the claim of Example 6. We will use the fact (see [8, p. 135]) that 



J(n) = llf 



exp 



n + n 



\ogn + 0{n) 



as n 



oo. 



Thus, using the hook formula and Stirling's formula we can compute 

1/2 



rf(A„) 



M\i /n^ , / J(2n-2) 

(2)' _ \^2"("-i)j(n-l)2 



I n-lQn-2Kn— 3 



■■(2n-3)i 



(2n-2)! 
2"-i(n-l)! 



exp 



n 



n 



log n + {\- log 2)n^ + 0{n) 



(5) 
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J(n)(n + + 2)"-2 . . . (2n - l)i 



)■ J(n) 

^ =exp 



J(n) 



(2n-l)! 



log n + (i - log 4)ra^ + 0(r2) . (6) 



By (5) and (2) we have #fi„ = (i(A„,) (also see Section 4 below). 

For a permutation G 5*^, a partial sorting network (also called a 
reduced word) of is a sequence (si, S2, . . . , s^) such that z/ = Ts^r^g ' ' ''Ts^ 
and /c = inv(z/). Let -R(z/) denote the number of partial sorting networks of 
u. In general, evaluation of Ri^u) is a deep problem — see e.g. [10, 18]. 

Let u be any permutation, and let k = inv(z/). Then u = (si, S2, . . . , sat) is 
a sorting network passing through configuration u if and only if (si, . . . , Sk) is 
a partial sorting network for u and (s^+i, . . . , sat) is a partial sorting network 
for u^^p. Hence the probability that the USN passes through u equals 

_ i^(^)/?(^"V) 

Pu(^. - ^) ^^^^^ . (7) 

Proof of Example 6. In the case of '0, we can compute the factors in (7) above 
explicitly. We have inv('(/') = h. Firstly, R{ip) is equal to (^^2) '^(^"72)^5 since 
to get from id to ip one must reverse the particles 1, . . . , |, and independently 
reverse the particles | + 1, . . . , n, with (^^2) choices for the order in which to 
intersperse the left- and the right-half swaps. 

Secondly, we claim that the number of partial sorting networks of il)~^p = 
(| -|- 1, . . . , n, 1, . . . , |) is equal to d(\I\n/2)- This is because, given such a 
partial sorting network (si, S2, ■ ■ ■ , sn-h), we can construct a standard Young 
tableau of shape 0^/2 whose ith row lists the times ki < k2 < ■ ■ ■ < kn/2 at 
which particle i moved, and it is easy to see that this map is a bijection from 
the set of partial sorting networks of il'^^p onto SYT(n„/2)- Thus we have: 

= (,/2)c^(A„/2)^ = 2"^/^+^Wd(A„/2)^ 

R{p) = rf(A„). 

An application of the asymptotics (5) and (6) for the number of tableaux 
together with (7) verifies the claim of Example 6. Interestingly, the leading 
terms in logn cancel in the exponent. □ 
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Figure 9: The sliding sequence and the Schiitzenberger operator. Shown 
are: (a) A tableau T. In bold is the sliding sequence (obtained by starting 
from the maximum entry and repeatedly passing to the larger of the entries 
above and left); (b) the tableau obtained by sliding the entries down along 
the sliding sequence; (c) the tableau $(T). 

4 The Edelman- Greene bijection 

Stanley, who proved (2), noticed that by the hook formula the right-hand 
side of (2) is equal to (i(A„), the number of staircase shape standard Young 
tableaux of order n. Later, Edelman and Greene [5] found an explicit bijec- 
tion between SYT(A„) and Qn- This bijection will play an important part 
in what follows, so we describe it and its inverse now. 

Given a standard Young tableau T e SYT(A), where = |A|, denote by 
{imax{T) , jmax{T)) the coordiuates of the cell containing the maximum entry 
in T. 

Define the Schiitzenberger operator $ : SYT(A) SYT(A) as follows. 
Start with a tableau T = Construct the sliding sequence of 

cells Co,Ci,...,Crf e A, where Cq = (Vax(r), jmax(r)) and q = (1,1), by 
the requirements that — Cj.+i = (1,0) or (0,1) for all < r < d — 1, 
and Cr — Cr+i = (1, 0) if and only if tcr_(i,o) > ^cr-(o,i) (where we adopt the 
notational convention that for a cell {i, j) with either of i, j being non-positive 
we have tjj = — oo). Then the tableau $(T) = {t'i^j)(i,j) is defined by setting 
^'cr ~ ^cr+i -l- 1 for < r < d — 1, t'i^i=l, and t^j = tij + 1 for all other cells 
€ A. The definition is illustrated in Figure 9. It is easy to see that $ is 
a bijection of SYT(A) onto itself. 

Definition. The Edelman- Greene bijection EG : SYT{An) — ^ fin is defined 
by 

EG{T) = (jmax($^"'(T))) 

\ J k=l,...,N 

where as before N = , and denotes the kth iterate of ^. 
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It is far from obvious that the map EG is a bijection to f2„, nor what its 
inverse looks hke. It turns out that the inverse may be described in terms of 
a Young tableau construction algorithm which is a modification of the RSK 
algorithm (see [22, Ch. 7.11]). Given a sorting network u = (si, S2, ■ ■ ■ ,sn) G 
Qn, we construct a sequence of (non-standard) Young tableaux Tq, Ti, . . . , Tjv 
whose shapes = Ao,Ai,...,AAr = An form an increasing sequence of dia- 
grams, i.e., Aj y Aj+i. To get Tj+i from Tj, apply the following insertion 
algorithm to the input (Tj, Sj+i). 

Insertion algorithm. Given a Young tableau T = of shape A 

and a positive number u, construct a new tableau T' = {t'ij) whose shape is 
the union of A with one new cell, as follows. 

Step 1. (Initialize). Set A; ^ 1 and q ^ u. Set t[j <— tjj for all («, j) G N^, 
with the convention that tj^- = oo for a cell (i,i) G \ A. 

Step 2. (Find next bumping cell). Set £ to be the least positive integer 
i such that tkj > q. Set t'f,^ ^ Q- H q = t^i, set g ^ g + 1, otherwise 
set q «— tki. Set k ^ k + 1. 

Step 3. If g = oo, terminate and return the enlarged tableau T'. Otherwise 
return to Step 2. 

Definition. The inverse Edelman- Greene bijection EG~^ : f2„ SYT(An) 
is defined by setting EG~^{uj) to be the recording tableau of the sequence of 
Young diagrams \q \\ ■ ■ ■ Ajv = A„ constructed above. 



Figure 10 shows EG~^ applied to the sorting network of Figure 3. The 
following theorem justifies these definitions. The proof can be found in [5]; 
see also [7]. 

Theorem 8 (Edelman and Greene). The map EG is a bijection from 
SYT(An) to ^In, and the map EG~^ is its inverse. 

As a first application of the Edelman-Greene bijection, we prove an exact 
formula for the distribution of the first swap location si = Si(a;„) of a uniform 
n-element sorting network Un, and use it to prove Theorem l(ii). 

Proposition 9 (Swap distribution). Ifun is a uniform n-element sorting 
network, then 

pn. ^ .^1 (3-5-7---(2r-l))(3-5---(2(n-r)-l)) 

' ' iV' (2-4-6---(2r-2))(2-4---(2(n-r)-2))- ^ ' 

Proof. Let 1 < r < n — 1. By the definition of EG, the sorting networks 
Lj = {si, S2, . . . , sn) G fi„ for which Si = r are exactly the ones for which the 
standard Young tableau EG~^(ct;) has its maximum entry in the cell (n — r, r). 
Since EG is a bijection, the number of such Ci;'s is the number of SYTs of 
shape A„ \ {(n — r, r)}. Thus 

Write this using the hook formula. Theorem 7, to yield the result. □ 

Proof of Theorem l(ii). Denote an,r = IPu(si = r),l < r < n — 1. Observe 
that by Proposition 9 we have 

2 (2r)(2r)! 2(n - r)(2(ra - r))! 
~ n{n-l)' 22-(r!)2 ' 22("-0((r2 - r)!)2 

wnin — 1) 



22r 22("-^) 



Therefore, using Stirling's formula in its explicit form 1 < m!(27rm) 

< 1 + ^2m~i immediate consequence of [6, Eq (9.15), p. 54]), we get that 



1-1) I- < 

6r / \ 6(n — r 



7m(n— 1) 

< ii + ^Vi 



24r - 1 y V 24(n - r ) - 1 
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This implies easily that for all —1 < a < 6 < 1 we have 

Pu fa < — - 1 < = y anr ^ - I Vl^dt, 

^ ^ t(a+l)<r<t(fe+l) 

as required. □ 



5 Limit profile for staircase tableaux 

For any Young diagram A we write Pa for the uniform measure on the set 
SYTiX) of standard Young tableaux. It is natural to consider the limiting 
behaviour of a random tableau of distribution Pa„ for a sequence of diagrams 
(A„) of a given shape and increasing size. For general shape, the problem 
of rigorously determining the complete limiting profile is open (see, however 
[14, 15] and [4, Theorem 1.5.1]). An exception is the square diagram 
where the problem was solved by Pittel and Romik [19]. In this section we 
use their result to derive a solution for the staircase diagram A„. 

We start by stating the main result from [19]. It will be convenient to 
use the following coordinate system. If (i, j) is a cell of then its rotated 
(and scaled) coordinates are 

n n 
(note that this differs from the coordinate system in [19] by a factor of ^/2). 

We define the following functions, which will describe the limiting profile. 
For a G [0, 2] the function ha : [—^ya{2 — a), a/«(2 — a)] — > [0, 1] is defined 
by 



, , , 2 r fu\ -1 ri" 

ha\u) := — u arctan I — ) + arctan it 

(9) 

for a G [0, 1] (where tan ^ oo := 7r/2 giving hi = 1), and by 

h2-a{u) := 2 - ha{u) 

for a G (1,2]. The curve v = ha{u) will approximate the level-(a?7,^/2) 
contour of the tableau; Figure 11 shows some of these curves. The function 
L : [0, 1] X [0, 1] [0, 2] is defined implicitly by 

/ u+v v-u\ 
L I , = a <^=^ ha[u) = V. 



where R 



\/ a(2 — a) — 

1 — a ' 
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The following result [19, Theorem l(i)]) gives the limiting profile for uni- 
form square Young tableaux. 

Theorem 10 (Limit profile for square tableaux; Pittel and Romik). 

Let Fo„ be the uniform measure on Young tableaux G SYT(Dn)- 

For any e > 0, 



Pi 



On 



max 

{j,i)Gn„ 



2s 



-L 



1 1 

n' n 



> e 



0. 



We shall deduce the following analogous result for the limit profile of a 
staircase tableau, where the function L is the same as above. 

Theorem 11 (Limit profile for staircase tableaux). Let Pa„ be the 

uniform measure on Young tableaux (tjj)(i,j)6A„ £ SYT{An). For any e > 0, 



P/ 



max 



2t- ■ 



n n 



> e 



0. 



Thus the limit profile for the staircase tableau is the same as that for 
half of the square tableau. Other results in [19] give explicit bounds on 
deviations from the limit profile, but only in the interior of the square. These 
estimates may be translated to staircase tableaux as well. However, uniform 
convergence in probability is sufficient for our purposes. It is important that 
Theorem 11 includes the boundary of the diagram. 
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Our main tool in proving the above is the following general result con- 
cerning continuity of random tableaux in the shape. For Young diagrams 
A, fi we write A C /i if this relation holds for A, fi as subsets of N^. 

Theorem 12 (Coupling). Let X C fi be a pair of Young diagrams. There 
exists a coupling of the measures ¥^ on S = {si,j) G SYT{fi) and Fx on 
T = (tij) G SYT{X) such that for all (z, j) G A 

^ + 1/^ \ -^l- 

To prove Theorem 12 we will make use of an algorithm from [12] for 
sampling from Pa- First note that, in order to simulate a tableau with 
distribution P^, it suffices to be able to choose the location Cmax = (Vax, jmax) 
of the maximum entry |A| with the correct distribution. For then, after 
inserting this entry, we may iteratively apply the same algorithm to the 
smaller diagram A \ {cmax} to locate the second largest entry, and so on. 

For a cell {i,j) G A, define its hook to be the set 

ff(,,,)(A) := {{k,j) eX:k>i}u {(z. A;) G A : A; > j}. 

The location Cmax of the maximum entry may be simulated using the following 
hook walk algorithm from [12]. 

Hook walk algorithm. Given a Young diagram X, choose a random se- 
quence of cells cq, . . . ,Cr iteratively as follows. 

Step 1. Choose a cell cq uniformly at random from A. 

Step 2. Given that cells cq, . . . , Ck-i have been chosen, choose uniformly 
at random from the hook Hc^._j^{X). 

Step 3. Repeat Step 2 until we obtain a cell Cr with ^Hc^{X) = 1, then 
stop. 

Theorem 13 (Hook walk; Greene, Nijenhuis and Wilf). The random 
final cell Cr constructed by the hook walk has the same distribution as Cmax 
under P^. 

Lemma 14 (Domination). Assume A C ^u, and let X' := X \ {cmax(A)} and 
fi' := fi\ {cinax(/u)} be the random Young diagrams obtained by removing the 
largest entry in the respective uniform standard Young tableaux. Then we 
have the stochastic domination X' fi' . 
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Proof. Consider the hook walk apphed to A and /i. It is enough to couple 
the two hook walks so that either they stop at the same cell, or the walk 
in fi stops at a cell in /i \ A. This will hold provided the two walks coincide 
until the first time the one in /i enters \ A. And this can be achieved as 
follows. Run the hook walk in fi according to the usual rules. Let the walk 
in A be identical to that in fi while the latter is in A. If and when the walk 
in /i jumps to a cell in /i \ A, continue the walk in A according to the usual 
rules for A using an independent source of randomness. It is easy to see 
that this gives the correct hook walk terminating probabilities for Cmax(A). 
(A key observation is that a uniformly chosen element in the hook 
conditioned to be in .,)(A) is distributed uniformly in j)(A)). □ 

Proof of Theorem 12. Construct the random tableaux S*, T iteratively by first 
choosing the maximum entry in each, then the second largest, and so on. Do 
this using the hook walks, and at each stage couple the two hook walks 
according to Lemma 14, so that the remaining unfilled Young diagrams are 
always ordered. Let m = A|. At the step when k is entered into A, say at 
location (z, j), the entry A; + m is entered into /x, and all subsequent entries 
entered into /i are < k + m. By the ordering property one of those subsequent 
entries (possibly k + m) will be at the cell Therefore Sij < k + m = 

tij + m. □ 

Proof of Theorem 11. Fix e > 0, and consider a random square tableau 
S = (sij) with law Pn„. Let /i be the random Young diagram obtained 
by removing the cells with entries greater than (1/2 + e)n'^ from □„,and de- 
fine the event = {A„ C /i}. From Theorem 10 we find that P(A„) — s> 1, 
because L = 1 along the diagonal. 

Note that, conditional on /i, the tableau obtained by restricting S to ft 
has law P^. Also let T = (tj j) have law Pa„- Theorem 12 implies that S 
and T can be coupled so that on the event An we have Sij < tij + n'^e for 
G A„. Thus by the square diagram result. Theorem 10: 



as required. The corresponding upper bound is similar: instead we begin by 




removing the entries greater than (1/2 — e)n'^. 



□ 
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We extract the following consequences of Theorem 11 for use in the later 
proofs. 

Corollary 15 (First row). For a staircase tableau T = (tij) let Rk = 
Rk{T) := max{j : ti j < k} be the number entries < k in the first row. For 
any e > we have 



Pa„ ( max 







where di 



n 



^ (n _ k_\ 

N \^ NJ- 



Proof. Theorem 11 easily implies that for any S, with probability tending to 
1 as n — s> oo, 



max 

j 



f-HK 



J 



< 5. 



(10) 



From the definition of L, the map x i— > L(0, x) is continuous and strictly 
monotone, and satisfies L(0, a/«(2 — a)) = a for a G [0, 1]. Therefore given 
any e > 0, we can choose 6 > such that for all x, a G [0, 1], 

|L(0,x) — a\ < 6 implies \x — \/ a{2 — a) | < e. 

We deduce that on the event (10) we have 

max I j — dt-i^ ] < en. 
j 

Since I— > (ifc is strictly monotone this implies that max^ \Rk — dk\ < en. □ 

Corollary 16 (Contours). Fix some a G [0, 1], and let Ti^ = TiaiT) be the 
set of entries in a staircase tableau T in the cells where v > ha{u). For any 
e > we have the following bound on the symmetric difference: 



P 



#(7^, A{[aiVl,...,iV}) >eN 



0. 



Proof. Fix e > 0. By the continuity and strict monotonicity of the function 
L, we may choose 6 > such that the area of the region D := {{u,v) : 
ha~2s{u) < V < ha+25iu)} is at most e. Then for n sufficiently large, on the 
event 



max 

(*,i)GA„ 



2ti, 



n^ 



1 L 

ra' n 



< (5, 



we have that all entries in the symmetric difference Tia A { \aN^^ , . . . , TV} lie 
in D, so the result follows from Theorem 11. □ 
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6 Law of large numbers 

This section contains the proof of Theorem 2. Recall the semicircle measure 
scmi{dx) = — dx. Fix some interval [a,b] C (—1,1), and 

define for < s < t < 1 and a sorting network uj: 

Ss,t{^) ■■= #{siV <k<tN : fsfcM - 1 G [a, 6]}. 

We will deduce Theorem 2 from the following. 

Lemma 17. Fix an interval [a,b] C (—1,1). For any 6 small enough (de- 
pending on a, b), 

Fl(\Sos -SNstmi[a,b] \ > 8N6^) > 0. 

Proof of Theorem 2. It suffices to prove that for any [a, b] C [—1,1] and for 
any e > and 0<s<t<lwe have 



^Ss,t - {t- s)semi[a, b] > e > 0. (11) 



Since the total number of swaps is deterministically A^, it is enough to prove 
this in the case [a,b] C (—1, 1). We deduce this from Lemma 17 as follows. 
Fix some positive integer m to be chosen later, split the time interval [s, t) 
into m smaller intervals of length 6 := — , and define the events 



Bk := {|5,+fc5,,+(fc+i)5(tu) - 6Nstmi[a,b]\ > 9N6^} , < A; < 



m. 



Let B = IJ-Bfc. By stationarity of the swap location process (Theorem l(i)), 
each of the random variables Ss+k5,s+{k+i)5 is within ±1 of a random variable 
having the same law as 6*0,5. Hence by Lemma 17, 



m—l 



FUB)<J2n(Bk) ^0. 

' ^ n. — >-rxn 



fc=0 

If B does not occur then the quantity 

1 

k=0 
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satisfies 



S. 



— m5Nstm\[a, h] | < dmNS 



2. 



s. 



i.e., since mS = {t — s) 



j^Ss,t -{t- s)semi[a, b] | < 9(t - s)6 < 96. 



Now (11) follows by setting m large enough that 96 < e. 



□ 



Proof of Lemma 1 7. By stationarity of the swap process, the first 6N swaps 
and the last 6N swaps have the same law. The idea of the proof is now as 
follows. From the Edelman-Greene bijection we see that the last 6N swaps 
are determined by the locations of the 6N largest entries in the staircase 
shaped Young tableau T corresponding to u. By Corollary 16, the set of 
these locations is almost deterministic, which will imply our claim. 

For a Young tableau T, consider Jmax('^'^r)- To find it we start with the 
element N — k in T, and perform k iterations of $. At each iteration, the 
entry increases by 1, and possibly moves one square towards the diagonal. 
If it started close to the diagonal, it can only hit the diagonal in a limited 
region. In particular, if N—k started in region A of Figure 12 then necessarily 
{^jmaxi^^T) — l) G [a,b]. Similarly, if it started in either of the regions 
labelled C then it will not exit through that interval, li N — k started in the 
region marked B, then whether or not it exits in the interval [a, b] depends 
on locations of other entries in the tableau. 

Let u = EG{T), and let As{T) be the number of entries greater than 
(1 — 6)N in region A of T, and similarly define Bs{T) with region B. We 



To prove the lemma we show that with probability tending to 1, for a 
uniformly random tableau T, Bs(T) and [^^(T) — 5A^semi[a, 6] | are both 
of order 6'^N. Here we use Corollary 16, with e of the corollary equal to 5^. 
Consider the tableau T in the (m, v) co-ordinate system of Section 5 as shown 
in Figure 12. Let H be the region where v > hi^s{u) for the function defined 
in (9) (the shaded region in Figure 12), and let Ti. be the set of entries of T 
in H. 

Corollary 16 states that with probability tending to 1, 



find 



0<SoA^)-MT)<Bs{T). 



(12) 
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-1 « ^ 1^ 



Figure 12: A Young tableau in the {u,v) co-ordinate system. Entries in 
region A can only exit through the interval [a,b], while entries in C cannot, 
and entries in B may or may not. The region where v > hi_s{u) is shaded — 
this is the typical location of the entries greater than (1 — 6)N. 



This implies that with probability tending to 1 we have 



As{T) - NC{HnA) 



< 6^N, 



Bs{T)-NC{HnB) 



< 6^N, 



where C denotes area of sets in the {u, v) plane. Applying these to (12) we 
find that (with probability tending to 1) 



So,s{uj) - NC{H nA) < (25^ + C{H n B)) N 



So, we need to estimate the areas of H (1 A and H H B. 

We use the Taylor expansion of hi-s{u) around 5 = 0, which is 



(13) 



hi^i{u) = 1 — '^yl^^vt? 5 + 0{5^) as 5 — > 0, uniformly in n G [a, h] 

(see [19, formula (7), p. 13]). To estimate C{HnB) note that the side length 
of each of the two "triangles" comprising fl i? is of order 6. Indeed, each 
is contained in a square with diagonal -6, and so 



1 



It remains to estimate C{H fl A). The fact that ^hi^s{u 
that for 6 small enough (depending on a, b), for any u G [a, b] the error term 



s=o 



(14) 



implies 
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in the Taylor expansion is at most 5"^. Consequently, the area oi H p[ A can 
be estimated by integrating: 



C{HnA)-5 l^fV^du 

J a 



< {b - a)6^ + C{H n B) <46^ 



(The C{H n B) term comes from the truncation near a and b.) 
The result follows by applying (14) and (15) to (13). 



(15) 



□ 



7 Octagon and Holder bounds 

In this section we prove Theorems 3 and 4. 

Lemma 18. Let TZk = TZk{uj) = (Afc)i be the length of the first row of the 
Young diagram Xk created by the first k steps of the EG~^ algorithm from a 
sorting network uj. Then (^^^{i) — i < Tlk for all i, k. 

Proof. The first row of the recording and insertion tableaux during the EG~^ 
algorithm behave exactly the same way as during the celebrated RSK algo- 
rithm. It is well known (see [22]) that the RSK algorithm applied to any 
sequence of numbers creates a tableau whose first row has length given by 
the longest increasing subsequence (check this on Figure 10). 

Here we only need the upper bound, which for completeness we verify 
here. Observe that the entries of the first row cannot increase as the EG~^ 
algorithm proceeds, as they only change through bumping which replaces 
an element by something less or equal. So given any increasing subsequence 
Oi, . . . , a£, we know that each has to be inserted to the right of where Oj-i 
was inserted. This shows that at any step k the length of the first row is at 
least the length of the longest increasing subsequence of the input so far. 

Now by time k the position of particle i has changed by ?7 = cr^'^{i) — i. If 
< then the required statement is vacuously true, and if r/ > then this 
implies that swaps at positions i, . . . ,i + rj — 1 have appeared in this order 
(with possibly other swaps in between). Thus r] < TZk, as required. □ 

Proof of Theorem 4- We use Lemma 18 and Corollary 15. With the notation 
there, the Edelman- Greene bijection (Theorem 8) shows that if a; = EG{T) 
then TZk{uj) = Rk(T). Therefore Corollary 15 gives for any e > 0, 

F^Ij {\fk ■.nk<dk + en) > 1, 
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and by Lemma 18 we deduce 

P^^ (Vj, k : a~\j) -j<d, + en) > 1. 

n— ►oo 

Since ak is a permutation this is equivalent to 

Pu (Vz, k:i- (Tfc(z) < 4 + en) > 1. 

The symmetries (3) and (4) now imply the other three required bounds. □ 
Proof of Theorem 3. Part (i). Fix e > 0. Consider the event 

E = jvz, VO < J < A: < iV : \ar^(z) - a^\z)\ < n^{k - jY'^ + ^| 



We will show that 



1. 



(16) 



This will be enough, since the effect of linear interpolation is negligible (that 
is proving (16) for all e > implies the required statement for all e > 0). 
Denote M = and K = [^\ ■ For each integer < < , denote the 

event 



E„ 



(^vMi'^)\ < -n + n 



2\k-vM\ 
N 



1/2 



By Theorem 4 together with stationarity, Theorem l(i) we get F\j{E^) 1. 
Since the number of these events is fixed, we deduce 



Pi 



n 



0<?;<A' 



1. 



We claim that if G no<t)<A' then u E E. For each < j < k < N 
consider two cases. First, it is possible that there is some < v < K such 
that vM < i < k < {v + 1)M. In this case, u E Ey implies that for all i 



1^, W 



2\j - vM\ 
N 



1/2 



^ e e e /-,^,^ 

< -n H — n = -n, 17 
- 8 8 4 ' ^ ^ 



where we used the fact that j — vM < Similarly, 
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and therefore e e e 

1^7^(0 - ^fe ^(01 <^n + -n = -n. 

The second possibihty is that for some Q <v < K we have that {v—l)M < 
j < vM < k. In that case, (17) is still true, and furthermore since u & Ey 
and k — vM < k — j we get 

Combining this with (17) gives 

Wj\^)-^,\^)\<'^n + n^{k-jy/', 

as claimed. 

Part (a). For some fixed sequence — > 0, consider the set of contin- 
uous functions T : [0, 1] [—1, 1] satisfying 

Vt,sG[0, 1] : |T(t) -T(s)| < V8|t- s| 

By part (i) we can choose ^ so that P(Tj(,„) G v4„) 1. 

Let u;(T, /i) = sup{|T(t) - T(s)| : \t - s\ < h}. It follows from P(Ti(„) e 
An) that 

lim lim sup E(w;(Tj(.„), /i) A 1) = 0. 

By [13, Theorem 16.5] we have tightness of the random sequence Tj(„) under 
this condition (note that the target space [—1, 1] is compact). This establishes 
the existence of subsequential limits. 

Now if we have a weakly convergent subsequence Tj(„(j)), then it must 

have the same limit as the conditioned random variables := {Tnn{j)) I 
Ti{n{j)) ^ ^n(j))- We may realize the sequence {T^} on the same probability 
space so that — ^ T a.s. [13, Theorem 4.30]. We conclude by observing 
that any limit of deterministic paths G Ani^j^ is Holder(A/8, \). □ 

8 Great circles 

In this section we prove Theorem 5. The idea is as follows. If a sorting 
network lies close to a great circle then its trajectories are close to sine curves 
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up to some time change. Equivalently, it is close to a stretchable network 
obtained by rotating a set of points as in the remark in the introduction. 
This set of points must have roughly uniform one- dimensional projections 
in all directions, so its empirical measure must be close to 2lrcf)i/2- Finally, 
since the inversion number of the resulting configurations is close to linear in 
the angle of rotation, the time change mentioned above must be linear. 
Here are the details. Denote the centre of S„ by c = (^^, . . . , and 

the radius by i? = \J~^^- Given the circle c„ we may choose a pair of 
orthogonal vectors u, v of length R so that the circle has the representation 

Cn = {cn{0)}e<m where 

Cn{0) = c + u COS ^ + V sin 6. 

For k E {0, . . . , A^}, define a sequence 6k (up to addition of multiples of 

2n) by 

9k = argmin \\a^^ - Cn{9)\\oc- 

6 

Thus CniOk) is the point of c„ closest in to a^^ . W.log. we may choose 
u so that ^0 = (this leaves us two possibilities for v). For other /c, the 
angle 6^ is uniquely determined inductively by requiring \Ok+i ~ < ^r. By 
symmetry, O^^ = (2k + l)7r for some integer k (and we will see that in fact 
k = 0). 

Fix some e > 0. The condition on Un implies that for n large enough 
(depending on e), 

Ikfc ^ - Cn(4)||^ < en for all k. (18) 
Since ||cr^"^ — cr^^';]^ ||oo = 1, this implies that 

\\cn{Ok+i) - CniOk)\\^ < 1 + 2en. 
Since R ~ n^^^, simple geometry implies that for n large enough we have 

\Ok+i - Ok\ < 2arcsin (^iit^^i^^ < 8e, 

for all k (the ^/n term comes from passing from the L°° norm to the 
norm). Thus {6k} does not change too quickly. In particular, there must be 
some k so that either 16*^ — 7r/2| < Ae, or \6k + 71/21 — '^e. We can negate v, 
so w.log. assume the former is the case. 
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Considering the ith coordinate in (18), one finds that 



(Tk\^- {^ + Ui cos Ok + v, sin Ok) <en. (19) 

We would hke to show that the sorting network is approximated by motion 
along the circle with constant speed, i.e. that 6^ ~ nk/N. If that were the 
case, part (i) of Theorem 5 would follow. As it is, (19) only implies that the 
paths are approximately sine curves up to a time change. The key point here 
is that the same time change applies to all particles. 
Define a probability measure Un on by 



n 



1 " 



where 6{x, y) is the delta measure at (x, y). Thus z/„ is the empirical measure 
for the (rescaled) coordinates of u and v. 

Lemma 19. With the above notations we have the vague convergence =^ 

Proof. We first claim that z/„ is supported inside the disc of radius 2. Indeed, 
the vector c + u approximates the identity permutation, and so (by (18)) all 
entries of -u are in [—1 — Se, 1 + Se]. For v, note that there is some k so that 
l^fc ~ 7r/2| < 4:6, we find that c + v approximates , with some additional 
error from ucos6'fc. Thus the coordinates of -v are all in [—1 — 6e,l + 6e]. 

We use the continuity theorem for the multi-dimensional characteristic 
function, (see e.g. [13, Theorem 5.3]). Thus it suffices to prove pointwise 
convergence of the characteristic function of Un to the characteristic function 
of 2lrcf)i/2- This in turn will be deduced from considering the one-dimensional 
projections of Un. 

More precisely, (19) says that for n large enough, for all k and i, 



{^Ui cos Ok + fwisin^fc) - {^ai^\i) - l) 



< 3e. 



This states that the projection of Un in direction 6k can be coupled to the 
empirical measure of a permutation scaled to [—1, 1] so that they differ by at 
most 2e. But the scaled empirical measure of a permutation consists of equal 
point masses along an arithmetic progression, and does not depend on the 
permutation. Let F{x) := OV A 1 be the distribution function of uniform 
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measure on [—1, 1]. If P0(a, b) = acos6 + 6sin^ denotes projection on a line 
in direction 6, then we deduce that for n large enough, for any x G M, 



< 3£ + - < 4e. 

n 



(20) 



For an arbitrary angle 6, there is necessarily some k such that either 
l^fc — ^1 < 46, or the same holds for 9 + 7i. Fix x G [— 1, 1]. Note that for two 
angles 4>,ip, and any z G we have \P(j)Z — P^z\ < \z\\(f) — tpl- Since Un is 
supported inside the disc of radius 2, PeUn is close to Pe^^n, and so 



{Peyn){-oo,x] - (Pe^z/„)(-oo,x] 



< 8e. 



(21) 



Combining (20) and (21) gives that for n large enough (depending on e), for 
all 6 and x G [—1, 1], 



{PgUn){-00,x] - F{x) 



< 12e. 



By monotonicity of cumulative distribution funcitons, the same bound holds 
for 1 < |x| < 2. Since the support of f„ is bounded in the disc of radius 2, 
for X outside [—2,2] we have the stronger identity (Pez/„,)(— oo, x] = F{x). 

We wish to compare the characteristic functions </> and (pn of 2lrcf)]^/2 
Un respectively. Note that for any 6, the measure Pg^xci)i^2 is the uniform 
measure on [—1,1]. We have that 



- (f)n){r cos 6, r sin 6) = e*''''P6i(z/„ - 2trcf)i/2 

Jr 



){dx) 



Integrating by parts gives 



bn){r cos 6', t sin ( 



< 



tre 



[Pgiur, -2lrc()i/2)](-oo, 



x\ 



dx 



y"^2|[P,(i/„-2lrcl3i/2)](-oo, 



< 

< 96e. 



x\ 



dx 



Since e can be arbitrarily small, this proves pointwise convergence of the 
characteristic functions, and therefore convergence of Un to 2lrcf)]^/2- ^ 
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Lemma 20. With the above notation, 



max 

k 



(71. — 

^ N 







Proof. Fix some 9, and let Pg denote the projection on a line in direction 9, 
so that 

Pe (x, y) = X cos 6* + y sin 6^, 

and consider the permutation Pn{9) derived from u, v by arranging i G 

in increasing order of Pg{ui,Vi). We first estimate the inversion number 

inv(p„(6')). Define 

We have 

-^inv(p„(6')) = jj dun dun, 
which is a continuous functional of Un- Since Un =^ 2lrcP)i/2) this implies 



^inv(p„,(6')) > [ [ lA{e) d^tc[}y2 dQltci)^/2 = - 



To check the last equality, note that the integral is the probability that the 
x-projections of two points chosen independently from 2lrc[)i/2 change order 
after rotation by at most 9. By rotational invariance the angle of the line 
between two such points is uniform on [0, vr]. 

Equation (22) holds for any fixed 9. However, inv(p„(6')) is increasing in 
9 G [0,7r], and consequently, 

1 9 

—imr{pn{9)) > — uniformly in 6^ G [0, vr]. (23) 

TV n— ♦oo TT 

Comparing (19) for i,j we find that for any e > 0, for n large enough we 
have 

0-^^(2) - cTfe ^(j) = {ui - Uj) COS 9k + {vi - Vj) sin 9k + 6 (24) 
with \5\ < 2en for all i,j. Say a pair {i,j) is ^-uncertain if 

\{ui - Uj) cos9k + {vi - Vj) sm9k\ < 2e. 
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So, for any i < j we have that > if and only if [pn{6k))[i) > 

{Pn{(^k)){j), unless (i, j) is ^fc-uncertain. Recall that any such pair i < j with 
"^^^(0 > ^k^U) contributes 1 to the number of inversions of cr^^ (hence, 
if is not 6'fc-uncertain, also to inv(p„(6'fc))). Consequently, inv(p„(^fc)) 
differs from inv(cT^^) = /c by at most the number of ^fc-uncertain pairs. 

It remains to bound the number of 6'-uncertain pairs. Fix e > 0, and 
consider the set S^i, of all strips of width 4e in M^. Since z/„ =^ 2trc[)^/2 5 
have 

limsup sup iyn{A) < 2e 

because 2trc[)i/2(^) ^ for any such strip. This implies that for large n 
for any i there are at most 2en values of j such that is 6'-uncertain for 
some 6. In summary, for n large enough, depending only on e, and any 6, 
the total number of ^-uncertain points is at most 2£?7,^. 

Combining (24) and the above discussion we find that for large n 

|inv(crfc) - inv(p„(6'fc))| < 2en'^, 

uniformly in 6. Since inv(crfc) = k, combining with (23) yields the result. □ 
Proof of Theorem 5. Combining Lemma 20 and (19) gives part (i): 

max |(^cT[fAr](z) - 1) - l{uiCos{Txt) + ViS\n{'Kt))\ > 0. (25) 

In particular, we have 2ui/n = 2i/n — 1 + o(l). By inserting (25) into the 
definition of Ht in (1), we find that UtijjJn) is close to Rti'm where Rt is the 
linear map -Rf (x, y) = (x, x cos(7rt) +?/ sin(7rt)), in the sense the two measures 
can be coupled with maximal distance tending to 0. Since z/„ =^ 2trcf)]^/2) 
this implies ^itil^n) =^ Slrcf)^, which is (ii). 

Next, we prove (iii). To sample from the scaled swap process ri{ujn) one 
may choose uniformly a pair of particles i,j and consider the time and lo- 
cation of their swap. Consider the pair of points Zi = ^(ui,Vi) and Zj = 
^{uj, Vj). If i,j are swapped at step k of the network, then o"^^(i) — ^(j) = 
1, so by (24) we have for n large enough that \Pe^{zi) — Pg^{zj)\ < 2e, and by 
(19) the scaled location of the swap is given to within 2e by Pe^Zi. Thus for 
any i, j, the time of the (i,j) swap is given by the angle of a certain line, and 
the location of the swap by the distance of the line from the origin, where 
this line passes within distance e of both Zi and Zj. Thus, unless 
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sufficiently close, the location of the two points approximately determines 
the time and place of the swap. 

Specifically, for any pair z' , as e — *• the set of possible times converges 
to a single time, and the set of possible locations converges to a single loca- 
tion. Since Un =^ 2trc[)i/2! follows that r]{un) converges to the measure 
resulting from applying the same operation to 2lrcf)i/2- 

Let z,z' G be chosen independently with law 2trcf)]^/2- Let 9 G [0,7r] 
be the angle that the line through them makes with the positive y-axis, and 
let r := zi cos6 + Z2 sin 6' be its signed distance from the origin. It remains 
to prove that 6 and r are independent, 6 is uniform in [0, vr] and r has law 
semi. Independence and uniformity of 6 are clear by rotational symmetry of 
2trcf)^/2- Finally, to calculate the distribution of r we introduce some further 
variables. Let z, z* be z, z' rotated by —9 and let 2 = (r, y) and z' = (r, y') be 
their coordinates. Let w = y/\^l — r"^ and w' = y' /Vl — r^- Thus we have 



Zi = r cos 9 — wVl — sin 9 
Z2 = r sin 9 + wVl — cos 9 
z[ = r cos 9 — w'Vl — r"^ sin 9 



z: 



'r, = r sin 9 + w'Vl — t"^ cos 9. 



We can compute the probability density function of r using the Jacobian of 
the transformation {zi, Z2, z'l, Z2) ^ {r,9,w,w'); after some straightforward 
manipulation we obtain 



1 /.I 



d{zi, Z2, z[, z'2) 



1 



27r^l - ll^ll^ 27rv/r 



I r'l|2 

F II2 



dw dw' d9 



2 



d{r, 9, w, w') 

as required. □ 
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